-
Notifications
You must be signed in to change notification settings - Fork 919
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[JNI] rmm based pinned pool #15219
[JNI] rmm based pinned pool #15219
Conversation
Signed-off-by: Alessandro Bellina <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we care about available bytes, we could track it in this class. But if it's not something we use in the plugin I don't think it is required.
Otherwise this looks good to me. One minor nit.
/ok to test |
One thing that could improve the tests is to actually write something to the buffers that are returned. |
/ok to test |
@jbrennan333 I added a test in my latest commit. Thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good to me.
long available = PinnedMemoryPool.getTotalPoolSizeBytes(); | ||
if (available < size) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(nit) Is this size check really needed at all? Doesn't hurt anything though, so just a nit.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah it's a little weird, but we can keep calling this function and it will shutdown/recreate it as needed, so I don't see a huge need to change it right now, plus it's pre-existing code that continues to work.
/ok to test |
/ok to test |
Unrelated python failure:
|
/merge |
Part of #14782.
This PR removes our old implementation of the java based pinned memory pool and replaces it with a jni layer on top of
rmm::pool_memory_resource<rmm::pinned_host_memory_resource>
This PR does NOT set the default cuIO pinned host resource. That is happening after this PR goes in #15079. We'll need a follow on PR to change
PinnedMemoryPool.initialize
method to add an argument to set the cuIO pinned host resource.I have run with this and version of it that are shared with cuIO and I can't find regressions in NDS at SF3K.
Note that we don't align anymore on our side. RMM is doing the same alignment we were doing before, using
std::max_align_t
.Note also that the rmm pool doesn't have a quick way to find out what the current size is. So we had some tests that were asserting for this, and I have removed the asserts. If we would like to get that back I am happy to work with RMM to figure out how to do that.